Discriminative bag-of-cells for imaging-genomics.
نویسندگان
چکیده
Connecting genotypes to image phenotypes is crucial for a comprehensive understanding of cancer. To learn such connections, new machine learning approaches must be developed for the better integration of imaging and genomic data. Here we propose a novel approach called Discriminative Bag-of-Cells (DBC) for predicting genomic markers using imaging features, which addresses the challenge of summarizing histopathological images by representing cells with learned discriminative types, or codewords. We also developed a reliable and efficient patch-based nuclear segmentation scheme using convolutional neural networks from which nuclear and cellular features are extracted. Applying DBC on TCGA breast cancer samples to predict basal subtype status yielded a class-balanced accuracy of 70% on a separate test partition of 213 patients. As data sets of imaging and genomic data become increasingly available, we believe DBC will be a useful approach for screening histopathological images for genomic markers. Source code of nuclear segmentation and DBC are available at: https://github.com/bchidest/DBC.
منابع مشابه
Supervised Topic Models for Video Activity Recognition
Topic models successfully capture latent structure useful for unsupervised analysis of bag-of-words data. Applying these models to domains such as video activity recognition requires two critical extensions: (1) incorporating supervised information (activity labels) to recover topic structure with greater discriminative power and (2) moving beyond the bag-of-words assumption to model temporal d...
متن کاملMulti-instance Learning with Discriminative Bag Mapping
Multi-instance learning (MIL) is a useful tool for tackling labeling ambiguity in learning because it allows a bag of instances to share one label. Bag mapping transforms a bag into a single instance in a new space via instance selection and has drawn significant attention recently. To date, most existing work is based on the original space, using all instances for bag mapping, and the selected...
متن کاملINAOE's Participation at PAN'15: Author Profiling task
In this paper, we describe the participation of the Language Technologies Lab of INAOE at PAN 2015. According to the Author Profiling (AP) literature. In this paper we take such discriminative and descriptive information into a new higher level exploiting a combination of discriminative and descriptive representations. For this we use dimensionality reduction techniques on the top of typical di...
متن کاملژنومیکس انگل ها
Genes carry instructions to make protein that affect body's cells and their physical activity. They also play an important role in the occurrence of various characteristics in the body. Recently, scientists in the new field of science known as genomics have studied the genetic instructions. Genomics deals with the discovery of all the sequences in the entire genome of organisms and is used to s...
متن کاملOnm-21: General Principles of Collecting and Storing Cord Blood Stem Cell
Cord blood is the blood that remains in the umbilical cord and placenta following birth, which is usually discarded It contains red blood cells, white blood cells, platelets, and plasma, like blood. In addition, cord blood is a rich source of stem cells that may have potentially lifesaving benefits for your baby and family. The cord blood of baby serves as an abundant source of stem cells. Thes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
دوره 23 شماره
صفحات -
تاریخ انتشار 2018